The statistical significance of nucleotide position-weight matrix matches

نویسندگان

  • Jean-Michel Claverie
  • Stéphane Audic
چکیده

MOTIVATION To improve the detection of nucleotide sequence signals (e.g. promoter elements) by position-weight matrices (PWM) using the concept of statistically significant matches. RESULTS The Mksite program was originally developed for analyzing protein sequences. We report NMksite, a new version adapted to the processing of nucleotide sequences. NMksite creates PWM from nucleotide sequence block alignments or occurrence tables using three weight computation schemes. An original feature of NMksite is the numerical computation of the statistical significance of PWM matches. The utility of this concept is demonstrated in the context of the prediction of splice sites and promoter regions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Position Weight Matrix, Gibbs Sampler, and the Associated Significance Tests in Motif Characterization and Prediction

Position weight matrix (PWM) is not only one of the most widely used bioinformatic methods, but also a key component in more advanced computational algorithms (e.g., Gibbs sampler) for characterizing and discovering motifs in nucleotide or amino acid sequences. However, few generally applicable statistical tests are available for evaluating the significance of site patterns, PWM, and PWM scores...

متن کامل

Prognostic Significance of MMP2 and MMP9 Functional Promoter Single Nucleotide Polymorphisms in Head and Neck Squamous Cell Carcinoma

Objective(s) Matrix metalloproteinases comprise a family of enzyme that is able to degrade components of extra cellular matrix. There are single nucleotide polymorphisms in the promoter regions of several genes with ability to influence cancer susceptibility. The aim of this study was to analyses association between MMP2 and MMP9 promoter polymorphisms and head and neck squamous cell carcinoma...

متن کامل

Calculating PSSM probabilities with lazy dynamic programming

Position-specific scoring matrices are one way to represent approximate string patterns, which are commonly encountered in the field of bioinformatics. An important problem that arises with their application is calculating the statistical significance of matches. We review the currently most efficient algorithm for this task, and show how it can be implemented in Haskell, taking advantage of th...

متن کامل

The XXmotif web server for eXhaustive, weight matriX-based motif discovery in nucleotide sequences

The discovery of regulatory motifs enriched in sets of DNA or RNA sequences is fundamental to the analysis of a great variety of functional genomics experiments. These motifs usually represent binding sites of proteins or non-coding RNAs, which are best described by position weight matrices (PWMs). We have recently developed XXmotif, a de novo motif discovery method that is able to directly opt...

متن کامل

The Comparison of Gait Kinematics in Over-Weight and Normal-Weight People across Age Groups

Objective Obesity and overweight have changed to very important factors in people movements in the modern world. Therefore, the present study was carried out to examine the effects of overweight on gait kinematic factors in children, young adults, middle-aged, and older adults. Methods The present study was a causal-comparative study in which 40 participants aged 9-85 were selected based on pu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer applications in the biosciences : CABIOS

دوره 12 5  شماره 

صفحات  -

تاریخ انتشار 1996